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GENERATION OF HUMAN INSULIN 

This is a continuation-in-part of U.S. Serial No 
08/175,298, filed December 29, 1993. 

Bac)ccrrf »y™ri o f the Invention 



Throughout this specification, various publications are 
referenced by Arabic numerals within parentheses. Full 
citations for these references may be found at the end of 
the specification immediately preceding the claims . The 
15 disclosures of these publications in their entireties are 

hereby incorporated by reference into this specification in 
order to more fully describe the state of the art to which 
this invention pertains. 

20 Insulin is a polypeptide hormone essential for the control 

of glucose metabolism and it is administered daily to 
patients suffering from diabetes mellitus, a metabolic 
disorder characterized by an inadequate supply of insulin. 

25 In vivo . the hormone is first synthesized as a long 

precursor molecule, subsequently processed to its 
biologically active form, consisting of an A and a B chain. 
In more detail, the gene for preproinsulin is transcribed in 
the beta cells of the endocrine pancreas into an mRNA 

30 precursor, which is then spliced to produce mature mRNA. 

This mRNA is translated into preproinsulin (NH 2 -preregion-B 
chain-C peptide-A chain-COOH) , which is sequentially 
processed into proinsulin and finally into insulin. The 
first step in the processing is the proteolytic elimination 

35 of the preregion, which serves as a hydrophobic signal 

sequence for the transfer of the nascent chain through the 
microsomal membranes of the rough endoplasmatic reticulum. 
In human preproinsulin, the length of the preregion is 24 
amino acids . 



V. 
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In proinsulin, the two regions of the polypeptide chain that 
will become the mature insulin, the B- and A chains, are 
connected to each other by the C peptide (or C-chain) , which 
comprises at the N and C termini two pairs of basic amino 
5 acids. In most C-peptides, these pairs are Arg-Arg and Lys- 

Arg. The human C peptide, including the two flanking pairs 
of basic amino acids, contains 3 5 amino acids. The C 
peptide connects the two portions of the polypeptide in 
order to aid in appropriate disulfide bridge formation 
10 between the B and A segments. Therefore the role of the C 

peptide does not depend greatly on its structure. In fact, 
its replacement by a shorter synthetic bridge still allows 
proper folding of the proinsulin molecule (1,2) . 

15 The proinsulin folds with the concomitant oxidation of two 

interchain disulfide bonds and of one disulfide bond within 
the A chain. In the last stage of maturation, proteolytic 
enzymes cleave at the basic amino acids to release the C 
peptide and form the mature insulin (3) . In human insulin, 

20 the A chain is 21 amino acids long while the B chain is 30 

amino acids long. 

World demand for insulin exceeds several tons annually and 
there is a severe shortage of supply. Traditionally, 
25 insulin was produced from limited animal sources, mainly 

bovine and porcine pancreatic preparations, which differ 
from human insulin and may elicit an adverse immune 
reaction. 

30 Studies carried out during the 1960's demonstrated in vj,trp 

production of insulin. Insulin synthesis was achieved by 
combining the A and B chains in their S- sulfonated forms (4) 
or by the spontaneous reoxidation of reduced proinsulin (5) . 
The latter method was not practical for large scale insulin 

35 production due to very low protein concentration in the 

oxidation mixture. Insulin could subsequently be recovered 
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following treatment with trypsin and carboxypeptidase B (6) . 

Semi -synthetic and biosynthetic (recombinant) human insulin 
have recently become available. Semi -synthetic human 
5 insulin is produced from poircine insulin by the trypsin 

catalyzed exchange of alanine with threonine at position 30 
of the B chain (the only difference between porcine and 
human insulin) . The recombinant human insulin produced 
either in E. coli or yeast will eventually replace all other 
10 routes of manufacture. 



Biosynthetic recombinant human insulin is currently 
manufactured by two routes: either by producing the A and B 
chains separately in E . coli and subsequently combining them 
15 (7,8), or by enzymatic conversion of pro-insulin like 

polypeptides expressed in either E. coli ( 1,8) or yeast 
(2,9) . 

In most cases proinsulin is produced as a hybrid protein 
20 which accumulates as intracellular precipitated protein. 

This hybrid is normally purified and cleaved by CNBr in 
order to release the proinsulin polypeptide. The latter is 
further modified by oxidative sulf itolysis to proinsulin S- 
sulfonate. The proinsulin S-sulfonate is then purified and 
25 folded, under reducing conditions, to proinsulin (8) . 

Conversion of the proinsulin to insulin is achieved by the 
combined action of trypsin and carboxypeptidase B (6) . 

Patent Publication No. EP 195691 Bl, assigned to Novo 
3 0 Nordisk A/S describes a proinsulin of the formula B-Lys-Arg- 

A and the use thereof for the preparation of insulin in 
yeast . 

Patent Publication No. EP 196056 Bl, assigned to Chiron 
35 Corp., describes an hSOD-proinsulin protein produced by 

yeast. The hSOD-proinsulin protein is subjected to cyanogen 
bromide cleavage and sulf itolysis prior to folding. 
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Hoechst discloses in EPO Publication No. 379162 that 'false 
recombinants of insulin precursors' (i.e. recombinant 
insulin products with incorrect or partially incorrect 
intermolecular disulfide bridges) can be converted to 
5 'correct' insulin products without sulfitolysis by reacting 

the false recombinants with excess mercaptan in an aqueous 
medium in the presence of an organic redox system. The 
original sulfitolysis step takes place after the amino acid 
or peptide radical is cleaved off (chemically or 

10 enzymatically) from the fusion polypeptide (which takes 

place after lysis of the host cell) since then the six 
cysteines of the insulin precursor are converted into their 
S-sulfonates . in a subsequent renaturing step, natural 
proinsulin is produced from this proinsulin S-sulfonate by 

15 formation of the three correct disulfide bridges. During 

this renaturing step, the so-called 'false recombinants' are 
produced. 

Hoechst further discloses, in PCT International Publication 
20 No. WO 91/03550, a process for the preparation of fusion 

proteins containing a desired protein (e.g. proinsulin) and 
a "ballast constituent". Sulfitolysis is carried out before 
folding while the "ballast constituent" is cleaved off 
concomitantly with the C-chain of the proinsulin, after 
25 folding. 

In addition, Hoechst describes in EP 347781 Bl , a "mini- 
proinsulin" (B-Arg-A) and the use thereof for the 
preparation of mono-Arg insulin and insulin. They further 
30 describe fusion proteins which comprise B-Arg-A and a 

"ballast constituent". The "ballast constituent" is cleaved 
off by cyanogen bromide and sulfitolysis is carried out 
before folding of the polypeptide. 

35 The subject invention discloses recombinant human insulin 

production by an improved and efficient process. 
Recombinant proinsulin hybrid polypeptides comprising a 
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leader sequence are synthesized in E.coli . After partial 
purification, they are folded with the leader peptide still 
attached under conditions which permit correct folding. 
Biologically active human insulin is then produced by 
5 combined treatment with trypsin and carboxypeptidase B in 

which these enzymes cleave off the leader peptide and the C- 
chain concomitantly. The purified human insulin thus 
produced is identical to naturally occurring human insulin. 



10 The hazardous and cumbersome procedures involved in CNBr 

cleavage of hybrid polypeptides and sulfitolysis used to 
protect the abundant SH groups are excluded from this novel 
process since the entire proinsulin hybrid polypeptide can 
fold efficiently into its native structure even in the 

15 presence of the leader peptide and the unprotected cysteine 

residues. The active recombinant human insulin is released 
by enzymatic cleavage and is thereafter purified. 
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Brief Description of th » Figures 

The restriction maps for the three plasmids shown in Figures 
3-5 do not identify all restriction sites present on these 
5 plasmids. However, those restriction sites necessary for a 

complete understanding of the invention, are shown. 

Figure It Human insulin generation by enzymatic 

cleavage of the folded, disulfide bonded proinsulin hybrid 
10 polypeptide produced by expression of plasmid pBAST-R. Only 

part of the SOD leader sequence is indicated. 

Figure 2: Human insulin generation by enzymatic 

cleavage of the folded, disulfide bonded proinsulin hybrid 
15 polypeptide produced by expression of plasmid pDB AST - LAT or 

plasmid pJtBAST-LAT. Only part of the SOD leader sequence is 
indicated. 

Figure 3: Structure of plasmid pBAST-R, an expression 

20 plasmid encoding an SOD-proinsulin hybrid polypeptide 

deposited with the ATCC under ATCC Accession No. 69362. 

Figure 4: Structure of pDBAST-LAT, an expression 

plasmid encoding an SOD-proinsulin hybrid polypeptide 
25 deposited with the ATCC under ATCC Accession No. 69361. 

Figure 5: Structure of pXBAST-LAT, an expression 

plasmid encoding an SOD-proinsulin hybrid polypeptide 
deposited with the ATCC under ATCC Accession No. 69363. 



30 



Figure 6; Amino acid and corresponding DNA nucleotide 

sequence of the SOD-proinsulin hybrid polypeptide expressed 
by plasmid pB AST - R . 



35 



Figure 7: Amino acid and corresponding DNA nucleotide 

sequence of the SOD-proinsulin hybrid polypeptide expressed 
by plasmids pDBAST-LAT and pABAST-IAT. 
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Figure 8: Human insulin product ion. from the pro insulin 

hybrid poly peptide expressed by plasmid pBAST-R. 
as a function of the pH o f the folding mixture. 

5 Folding of the proinsulin hybrid polypeptide (produced as 

described in Example 2) was performed at various pH's as 
indicated in lOOmM glycine buffer at 4°C for about 16 hours 
with either 1 mg/ml or 0.5 mg/ml of the hybrid polypeptide. 
The folded material was treated with trypsin (1:500 w/w) 
10 (Sigma) and carboxypeptidase B (CPB, Sigma, 1:200 w/w) for 

3 0 minutes at 37° at pH 9 and assayed for immunoreactive 
(IR) insulin by radioimmunoassay utilizing 125 I -insulin 
(Amersham) and human recombinant insulin (Calbiochem) as 
standard. 

15 

Figure 9: Human j u\ j n pr ofly p fcjoja tXSB — tim — proinsulin 

hybrid polypeptide expressed bv plasmid pDBAST- 
LAT 

20 The proinsulin hybrid polypeptide (produced as described in 

Example 2) was dissolved in 8M urea, 5mM HC1 at a 
concentration of about 3 0 mg/ml and diluted to 1 mg/ml in 
lOOmM glycine-NaOH, pH 11.0. Folding was carried out at 
22°C (room temperature) for 20 hours. The solution was then 

25 adjusted to pH 8.8 with HC1 . Carboxypeptidase B (1:1000 

w/w, Sigma) and trypsin (1:2000 w/w, Sigma) were added and 
the reaction mixture was incubated at 37°C for 60 minutes. 
Digestion mixtures were acidified to pH 3 before being 
diluted with 10 mM HCl . 150 pi aliquots were analyzed by 

3 0 Reverse Phase -High Pressure Liquid Chromatography (RP-HPLC) 

on a 250x4 mm, 5/x Lichrosphere 100 RP-8 column (Merck) which 
was equilibrated with 50 mM tetraethylammonium phosphate, 
162mM NaCl0 4 , pH 3, containing 31.5% (v/v) acetonitrile . 
The column was developed with a linear gradient of 31.5- 

35 40.5% acetonitrile during 75 minutes at a flow rate of 1 

ml/minute. Absorbance was monitored at 220 nm. 
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A: 5 M9 of standard insulin (Boehringer- 
Mannheim) ; 

B: recombinant human insulin produced following 
enzymatic treatment; 
5 C: folded SOD-proinsulin hybrid polypeptide. 

Figure 10: Human insulin p roduction from the proinsulin 

hybrid p olypeptide expressed by. — plagmjd 

pDBAST-LAT as a fun ction of the pH in the 

io foxing nuxtvre 

The proinsulin hybrid polypeptide (produced as described in 
Example 2) was diluted to 1 mg/ml in lOOmM glycine-NaOH 
buffer having the indicated pH values and was folded at 22°C 
15 for 16 hours. Enzyme treatment and RP-HPLC analysis was 

carried out as described in Figure 9. The amount of 
recombinant human insulin produced from the hybrid 
polypeptide was calculated according to the area of the peak 
which had the same retention time as standard insulin. 



20 



25 



Figure 11: Human insulin pro duction from the proinsulin 

hybrid polypeptide ex pressed bv plasmid 
oDBAST-LAT as a function of the ascorbic 
acid concentration in the fo lding mixture 



Folding of the SOD-proinsulin hybrid polypeptide (produced 
as described in Example 2) was carried out at 1 mg/ml in 
lOOmM glycine-NaOH at 22°C # pH 11.2 in the presence of the 
indicated concentrations of ascorbic acid. Samples were 
30 treated with trypsin and carboxypeptidase B (as in Figure 9) 

after 5 and 25 hour folding periods. Recombinant human 
insulin production was analyzed on RP-HPLC (as in Figure 9) . 



35 



Figure 12: 



A uthenticity of h uman insulin produced from 
the proinsulin h vbrid polypeptide expressed 
bv Plasmid pDBAST-IAT 
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25 



Folding of the SOD-proinsulin hybrid polypeptide (produced 
as described in Example 2) was carried out at 1 mg/tnl in 
lOOmM glycine-NaOH, pH 11.2 and 1.2mM ascorbic acid at 22°C 
for 16 hours. Following enzymatic treatment (as in Figure 
5 9) , the mixture was chroma tographed on a DEAE-Sepharose 

column equilibrated in 20mM Tris-HCl, pH 8. Recombinant 
human insulin was eluted with a linear gradient of 0-0. 4M 
NaCl in 20mM Tris-HCl, pH 8. Peak fractions were pooled and 
acidified with HCl to pH 3. The recombinant human insulin 

10 was further purified from insulin- like molecules by RP-HPLC 

as described for Figure 9. The major peak was collected, 
desalted on Sephadex G-25 column in 0.25M Acetic acid and 
lyophilized. Samples (5 /ig of recombinant human insulin) 
were prepared in lOmM HCl and were analyzed by RP-HPLC under 

15 the same conditions. 



A 
B 
C 



Standard insulin; 

HPLC purified recombinant human insulin ; 
Combined sample of HPLC purified recombinant 
human insulin and standard insulin. 



Figure 13: Human insulin product ign from the prpins^iUn 

hybrid polypeptide expressed bv plasmid 
pDBAST-kftT ae fl functjpn q£ protein 

concentration jp the folding t^^^ure 



SOD-proinsulin hybrid polypeptide (produced as described in 
Example 2) was folded in lOOmM Glycine-NaOH, pH 11.2 at a 
final protein concentration from 0.5 mg/ml to 10 mg/ml as 
j indicated. Each folding mixture was supplemented with 2.5 

30 moles ascorbic acid per mole SH group. Folding was carried 

out at 24°C (room temperature) for 16 hours. Enzymatic 
treatment and RP-HPLC analysis were performed as described 
for Figure 9 . 
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Pigura 14: Human insulin production from the prpjnsuljn 

hybrid polypeptide exp ressed bv plasmid 

pDBAST-LAT from crude j.pt;racel lular 

precipitate a s a function of folding time 

Intracellular precipitate was dissolved in 20mM Glycine - 
NaOH, 33/iM EDTA, pH 11.2 at a concentration of about 2.6 
A 2S0 P er ral - The P H was adjusted to 12 with ION sodium 
hydroxide. The solution was left stirring for 10 minutes. 
The pH was titrated to 11.2 with concentrated hydrochloric 
acid. Activated charcoal (acid washed, Sigma) was added to 
0.1% w/v final concentration and the mixture was stirred for 
30 minutes. The suspension was centrifuged (20 min., 12000 
rpm) at 20 °C. The clarified supernatant had an A 280 of 
about 2.15. Ascorbic acid was supplemented to 3 mM final 
concentration. Folding of the proinsulin hybrid polypeptide 
was carried out as shown, with vigorous stirring at room 
temperature (22-23 °C) . At various time points along the 
experiment (starting from dissolution) 10 ml aliquots were 
withdrawn, titrated to pH 8.8 and digested with 
carboxypeptidase B (1:1000 w/w) and trypsin (1:2000 w/w) for 
1 hour at 37°C in the presence of 50/zM ZnCl 2 . Digestion was 
terminated by acidification. Insulin content in each 
digested sample was determined by RP-HPLC analysis as 
described in Figure 9. The progress of the folding reaction 
is manifested by the increase of insulin (after digestion) , 
and the decrease in the level of free thiol groups, the 
latter being assayed by the Ellman reaction (16) . 
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The subject invention provides a method of producing human 
insulin which comprises folding a hybrid polypeptide 
5 comprising proinsulin under conditions that permit correct 

disulfide bond formation, subjecting the folded, disulfide 
bonded hybrid polypeptide to enzymatic cleavage to produce 
active human insulin, and purifying the active human 
insulin. 

0 

The subject invention further provides a polypeptide 
comprising proinsulin and a leader peptide attached to the 
N-terminus of the proinsulin, wherein the polypeptide is 
folded and contains correct disulfide bonds. 
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The plasmids pBAST-R, pDBAST-LAT and pABAST-LAT were 
deposited in E. coli pursuant to, and in satisfaction of, 
5 the requirements of the Budapest Treaty on the International 

Recognition of the Deposit of Microorganisms for the 
Purposes of Patent Procedure with the American Type Culture 
Collection (ATCC) , 12301 Parklawn Drive, Rockville, Maryland 
20852 under ATCC Accession Nos. 69362, 69361 and 69363 
10 respectively on July 26, 1993. 

As used herein, a hybrid polypeptide comprises a leader 
peptide covalently attached to a desired polypeptide. The 
hybrid polypeptide of the subject invention comprises 
15 proinsulin, and preferably comprises SOD as the leader 

peptide - 

As used herein, folding comprises folding of a hybrid 
polypeptide comprising proinsulin without CNBr cleavage 

2 0 before folding and without sulfitolysis before folding to 

protect SH groups, wherein the folding permits correct 
disulfide bond formation in the hybrid polypeptide. 

As used herein, correct disulfide bond formation of the 
25 hybrid polypeptide comprises the formation of three 

disulfide bonds between Cys B7 -Cys A7 , Cys B19 -Cys A2 °, and Cys A6 - 
Cys* 11 of insulin (Cys residues are numbered according to 
their numbering in mature insulin) . 

3 0 As used herein, proinsulin comprises a polypeptide 

comprising, from N-terminal to C- terminal order, the B, C 
and A chains of insulin. 

As used herein, the C-chain peptide of insulin comprises the 
3 5 naturally-occurring C-peptide and any other oligopeptide, 

dipeptide or single amino acid which can be cleaved off by 
trypsin and carboxypeptidase B. 
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As used herein, a leader peptide comprises any peptide or 
polypeptide covalently attached to the B chain of insulin 
which permits folding and disulfide bond formation and which 
can be cleaved off by means of trypsin. The leader peptide 
5 is preferably SOD. 

As used herein, SOD comprises any substantial part of the 
amino acid sequence of CuZnSOD or MnSOD and said part does 
not necessarily have the biological activity of SOD nor does 
10 it necessarily have the identical amino acid sequence of 

such a part compared to the amino acid sequence of 
naturally-occurring SOD. The DNA encoding the SOD may be 
mutated by methods known to those skilled in the art, e.g. 
Bauer et al. (1985), Gene 37: 73-81. 

15 

The leader peptide may comprise, instead of SOD, any other 
peptide, polypeptide or protein or any substantial part of 
the amino acid sequence of such a peptide, polypeptide or 
protein wherein said part does not necessarily have the 

20 biological activity of said peptide, polypeptide or protein 

nor does it necessarily have the identical amino acid 
sequence of such a part compared to the amino acid sequence 
of the naturally-occurring peptide, polypeptide or protein; 
however the leader peptide must permit folding and correct 

25 disulfide bond formation of the hybrid polypeptide. 

As used herein, insulin may comprise a homolog of naturally 
occurring insulin. 

3 0 As used herein, proinsulin may comprise a homolog of 

naturally occurring proinsulin. 

As used herein, the term M homolog M relating to the insulin 
polypeptide produced by the methods of the subject 
35 invention, is a polypeptide which has substantially the same 

amino acid sequence and substantially the same biological 
activity as insulin. Thus, a homolog may differ from the 
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insulin polypeptide produced by the methods of the invention 
by the addition, deletion, or substitution of one or more 
non-essential amino acid residues, provided that the 
resulting polypeptide retains the biological activity of 
5 insulin- Persons skilled in the art can readily determine 

which amino acids residues may be added, deleted, or 
substituted (including with which amino acids such 
substitutions may be made) using established well known 
procedures, including, for example, conventional methods for 

10 the design and manufacture of DNA sequences coding for 

bacterial expression of polypeptide homologs of the subject 
polypeptide, the modification of cDNA and genomic sequences 
by site-directed mutagenesis techniques, the construction of 
recombinant proteins and expression vectors, the bacterial 

15 expression of the polypeptides, and the measurement of the 

biochemical activity of the polypeptides using conventional 
biochemical assays. 

The above definition of homologs of insulin applies equally 
2 0 to homologs of proinsulin. 

Examples of homologs of insulin produced by the methods of 
the subject invention are deletion homologs containing less 
than all the residues of naturally-occurring insulin, 
25 substitution homologs wherein one or more residues specified 

are replaced by other residues, and addition homologs 
wherein one or more amino acids residues are added to a 
terminal or medial portion of the insulin polypeptide, all 
of which share the biological activity of insulin. 



30 



35 



Examples of homologs are the insulin analogs disclosed in 
EPO Patent Application EP 384472 and also the insulin analog 
"Humalog" of Eli Lilly as disclosed in "Eli Lilly and 
Company Report to Shareholders 1992". 

Substantially the same amino acid sequence is herein defined 
as encompassing substitutions and/or deletions and/or 
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additions of amino acids in the amino acid sequence and may 
encompass up to ten (10) residues in accordance with the 
homologous or equivalence groups as described by e.g. Albert 
L. Lehninger, Biochemistry, second edition, Worth Publishers 
5 Inc. (1975), Chapter 4; Creighton, protein Structure, a 

Practical Approach, IRL Press at Oxford University Press, 
Oxford, England (1989); and Margaret 0. Dayhoff, Atlas of 
Protein Sequence and Structure, Volume 5, The National 
Biomedical Research Foundation (1972), Chapter 9. Such 
10 substitutions are known to those skilled in the art. 



The DNA encoding the insulin polypeptide may be mutated by 
methods known to those skilled in the art, e.g. Bauer et al. 
(1985), Gene 17: 73-81. The mutated sequence may be 
15 inserted into suitable expression vectors as described 

herein, which are introduced into cells which are then 
treated so that the mutated DNA directs expression of the 
polypeptide homolog. 

20 The plasmids of the subject invention comprising a sequence 

encoding a hybrid polypeptide comprising proinsulin may be 
adapted for expression in bacteria, yeast, fungi or 
mammalian cells such as CHO, chicken embryo, fibroblast or 
other known cell lines which additionally comprise the 

25 regulatory elements necessary for expression of the cloned 

gene in the bacteria, yeast, fungi or mammalian cells, so 
located relative to the nucleic acid encoding the hybrid 
polypeptide, in order to permit expression thereof. 
Regulatory elements required for expression include promotor 

30 sequences to bind RNA polymerase and a ribosomal binding 

site for ribosome binding. 

The plasmids of the subject invention express a hybrid 
polypeptide comprising proinsulin. 

35 

Those skilled in the art will understand that the plasmids 
deposited in connection with this application may be readily 
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altered by known techniques (e.g. by site-directed 
mutagenesis or by insertion of linkers) to encode expression 
of homologous polypeptides. Such techniques are described 
for example in Sambrook, J., Fritsch, E . F. and Maniatis, T. 
(1989) Molecular Cloning: A Laboratory Manual, 2nd edition, 
Cold Spring Harbor Laboratory Press. 

The suitable regulatory elements are positioned within the 
plasmid relative to the DMA encoding the hybrid polypeptide 
comprising proinsulin, so as to effect expression of the 
hybrid polypeptide in a suitable host cell. In preferred 
embodiments of the invention, the regulatory elements are 
positioned close to and upstream of the DNA encoding the 
hybrid polypeptide. 

Various ribosomal binding sites (RBS's), for rendering mRNA 
transcribed from DNA encoding a hybrid polypeptide 
comprising proinsulin capable of binding to ribosomes within 
the host cell, are also included in the subject invention, 
such as the deo RBS. 

The plasmids of the invention also contain an ATG initiation 
codon. The DNA encoding the hybrid polypeptide comprising 
proinsulin is in phase with the ATG initiation codon. 

The plasmids of the invention also include a DNA sequence 
comprising an origin of replication from a bacterial plasmid 
capable of autonomous replication in the host cell. 
Suitable origins of replication may be obtained from 
0 numerous sources, such as from plasmid pBR322 (ATCC 

Accession No. 37017) . 

The plasmids of the subject invention also include a DNA 
sequence which contains a gene associated with a selectable 
5 or identifiable phenotypic trait which is manifested when 

the plasmid is present in the host cell such as a drug 
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resistance gene, e.g. resistance to ampicillin, 
chloramphenicol or tetracycline. 

Examples of vectors that may be used to express the nucleic 
5 acid encoding the hybrid polypeptides (comprising 

proinsulin) are viruses such as bacterial viruses, e.g., 
bacteriophages (such as phage lambda) , cosmids, plasmids and 
other vectors. Genes encoding hybrid polypeptides comprising 
proinsulin are inserted into appropriate vectors by methods 

10 well known in the art. For example, using conventional 

restriction endonuclease enzyme sites, inserts and vector 
DNA can both be cleaved to create complementary ends which 
base pair with each other and are then ligated together with 
a DNA ligase. Alternatively, synthetic linkers harboring 

15 base sequences complementary to a restriction site in the 

vector DNA can be ligated to the insert DNA, which is then 
digested with the restriction enzyme which cuts at that 
site. Other means are also available. 

20 Preferred bacterial host cells are E. coli cells. Examples 

of suitable E.coli cells are strains S0733 (cytRstrA) or 
4300, but other E. coli strains and other bacteria can also 
be used as hosts for the plasmids. 

25 The bacteria used as hosts may be any strain including 

auxotrophic (such as A1645) , prototrophic (such as A4255) , 
and lytic strains; F* and F~ strains; strains harboring the 
cI857 repressor sequence of the X prophage (such as A1645 
and A4255) and strains devoid of the deo repressors and/or 

30 the deo gene (see European Patent Application Publication 

No. 0303972, published February 22, 1989). E . coli strain 
S#733 and E . coli strain 4300 have been deposited under ATCC 
Accession Nos . 69361 and 69363 respectively. 

35 All the E. coli host strains described above can be "cured" 

of the plasmids they harbor by methods well known in the 



96/20724 PCT/US94/13268 

-18- 

art, e.g. the ethidium bromide method described by R.P. 
Novick in Bacteriol . Review 33 , 210 (1969) . 

The subject invention provides a method of producing insulin 
which comprises folding a hybrid polypeptide comprising 
proinsulin under conditions that permit correct disulfide 
bond formation, subjecting the folded, disulfide bonded 
hybrid polypeptide to enzymatic cleavage to produce insulin, 
and purifying the insulin. The insulin has the activity and 
properties of commercially available human insulin. 

In a preferred embodiment, the folding comprises incubating 
the hybrid polypeptide at about 4-37°C for a period of about 
1-30 hours at a pH of about 8.5-12.0. 

In another preferred embodiment, the folding comprises 
incubating the hybrid polypeptide at about 4-37°C for a 
period of about 1-30 hours at a pH of about 8.5-12.0 in the 
presence of ascorbic acid. 

In an especially preferred embodiment the pH during folding 
is 11.0-11.25. 

In another especially preferred embodiment the concentration 
of ascorbic acid is about 2 moles per mole SH group present 
in the folding mixture. 

In yet another embodiment the incubation period is about 5 
hours . 



In another embodiment the subjecting comprises adjusting the 
pH to about 8.8-9.0 and cleaving the hybrid polypeptide with 
trypsin and carboxypeptidase B at 16-37°C for about 30 
minutes to 16 hours. 

In another embodiment, the purifying comprises DEAE- 
Sepharose chromatography and RP-HPLC. 
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In yet another embodiment, the purifying further comprises 
ultrafiltration and CM-Sepharose chromatography. 

In an especially preferred embodiment, the purifying further 
5 comprises DEAE-Sepharose chromatography and Phenyl -Sepharose 

chromatography . 

In an especially preferred embodiment, the hybrid 
polypeptide is expressed by plasmid pDBAST-LAT deposited 
10 under ATCC Accession No. 69361. 

In another preferred embodiment the hybrid polypeptide is 
expressed by plasmid pXBAST-LAT deposited under ATCC 
Accession No. 69363. 

15 

In another embodiment the hybrid polypeptide is expressed by 
plasmid pBAST-R deposited under ATCC Accession No. 69362. 

In a preferred embodiment the hybrid polypeptide is obtained 
20 by treating a bacterial cell containing DNA encoding the 

hybrid polypeptide, so that the DNA directs expression 
thereof and recovering the hybrid polypeptide from the cell. 

It is envisaged that the treating comprises fermentation in 
25 the presence of glucose, glycerol or galactose. 

It is further envisaged that the recovery of the hybrid 
polypeptide from the cell comprises disrupting the cell wall 
of the bacterial cell or fragments thereof to produce a 
3 0 lysate, isolating intracellular precipitate from the lysate 

by centrifugation, solubilizing the precipitate and 
optionally purifying the hybrid polypeptide by 
chromatography or ultrafiltration. 



35 



The subject invention further provides a polypeptide 
comprising proinsulin and a leader peptide attached to the 
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N-terroinus of the proinsulin, wherein the polypeptide is 
folded and contains correct disulfide bonds. 

In a preferred embodiment the leader peptide is derived from 
the N-terminus of CuZnSOD. 

In an especially preferred embodiment the leader peptide 
comprises 62 amino acids, being preceded by the amino acid 
Met and followed by an Arg residue. 

In a preferred embodiment the proinsulin comprises the 
insulin B-chain linked to the insulin A chain by a single 
Arg residue. 

15 In another embodiment, the proinsulin comprises the insulin 

B-chain linked to the insulin A chain by the dipeptide Lys- 



10 



The above two proinsulin molecules have to be produced as 
20 hybrid proteins, otherwise expression levels are extremely 

low and not of commercial significance. 

In all preferred embodiments the cysteine residues of the 
leader peptide have been replaced by serine residues. 

25 



WO 96/20724 



-21- 



PCT/US94/13268 



Examples 

The Examples which follow are set forth to aid in 
understanding the invention but are not intended to, and 
should not be construed to, limit its scope in any way. The 
Examples do not include detailed descriptions for 
conventional methods employed in the construction of 
vectors, the insertion of genes encoding polypeptides into 
such vectors or the introduction of the resulting plasmids 
into hosts. The Examples also do not include detailed 
description for conventional methods employed for assaying 
the polypeptides produced by such host vector systems . Such 
methods are well known to those of ordinary skill in the art 
and are described in numerous publications including by way 
of example the following: 

Sambrook, J., Fritsch, E.F. and Maniatis, T. (1989) 
Molecular Cloning; A Laboratory Manual. 2nd edition, Cold 
Spring Harbor Laboratory Press. 
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confltruction of production plasmids pBftST-R, pPBAgT-uyT an<? 
pABAST-LAT expressing SOD-p r-oinsulin hybrid polypeptides 

5 

Bacterial expression vectors, which overproduce hybrid 
proteins in E. coli , under the control of either the flep 
p i p 2 or * P L promoter, were constructed- Proinsulin was 
produced as a hybrid protein since it was found that 

10 bacteria harboring an expression vector encoding Insulin B- 

chain-Lys-Arg- Insulin A-chain produced no detectable 
polypeptide. The hybrid proteins comprise a leader peptide, 
62 amino acids long, derived from the N-terminus of CuZnSOD 
(11) , preceded at the N-terminus by a Met residue and 

15 followed at the C-terminus by an Arg residue linking it to 

insulin B-chain. The insulin B-chain is linked to insulin 
A-chain by a short C-chain peptide consisting of Lys-Arg or 
Arg. The two cysteines originally present in the SOD 
portion were replaced by serine residues. 

20 

Plasmid pBAST-R 

A series of plasmids was constructed culminating in pBAST-R, 
which upon transformation of the proper E. coli host cells 
25 was capable of directing efficient expression of a 

proinsulin hybrid polypeptide useful for human insulin 
production. 

The structure of plasmid pBAST-R, encoding SOD- Insulin B 
3 0 chain-Lys-Arg- Insulin A chain hybrid polypeptide is shown in 

Figure 3 ; the DNA sequence and corresponding amino acid 
sequence of the hybrid polypeptide are shown in Figure 6 . 

Plasmid pBAST-R is about 43 80 bp long and comprises the 
35 following elements {in a counterclockwise direction) : 
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1. A DNA fragment, 1521 bp long, spanning Aat ll- Msc I sites 
on pBR322 which includes the tetracycline resistance 
gene . 

5 2. A DNA fragment, 1497 bp long, spanning S£aI-HsfiII sites 

on pBR322 which includes a truncated ampicillin 
resistance gene and the origin of DNA replication. 

3. A DNA fragment, 930 bp long, spanning Ava ll- Nde l sites 
10 on E. coli DNA which includes the deo P X P 2 promoters 

and ribosomal binding site (RBS) (13) . 

4. A DNA fragment, 188 bp long, spanning Nde l- PpuM I sites 
of human CuZnSOD cDNA. The cysteines at positions 6 

15 and 57 of mature SOD were substituted with serine 

residues by oligonucleotide site-directed mutagenesis 
<12) . 

5. A synthetic DNA fragment, 172 bp long, with PpuM I and 
20 BamH I ends. This region encodes Arg- insulin B chain- 

Lys-Arg- insulin A chain. 

6. A synthetic 36 bp multiple cloning site polylinker with 
BamH I and Hind i I I ends. 

25 

7. A synthetic 44 bp oligonucleotide containing the TrpA 
transcription terminator with HingUI and &&£II ends 
(10) . 

30 Plasmid pBAST-R, which confers tetracycline resistance and 

which encodes the SOD- Insulin B chain-Lys-Arg- Insulin A 
chain hybrid polypeptide, was introduced into E. coli strain 
S0733 (cytRstrA) and deposited in the ATCC under ATCC 
Accession Number 69362 on July 26, 1993. 

35 
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Plasmid pDBAST-LAT 

Another series of plasmids was constructed culminating in 
plasmid pDBAST-LAT, which upon transformation of the proper 
5 E. coli host cells was capable of directing efficient high 

level expression of a proinsulin hybrid polypeptide useful 
for human insulin production. 

The structure of plasmid pDBAST - LAT , encoding SOD- Insulin B 
10 chain-Arg- Insulin A chain hybrid polypeptide is shown in 

Figure 4; the DNA sequence and corresponding amino acid 
sequence of the hybrid polypeptide are shown in Figure 7. 
Plasmid pDBAST-LAT is about 4377 bp long and comprises the 
following elements (in a counterclockwise direction) : 

15 

1. A DNA fragment, 1521 bp long, spanning Aa£II-tl£cI sites 
on pBR322 which includes the tetracycline resistance 
gene . 

20 2. A DNA fragment, 1497 bp long, spanning fic^I-Ha^II sites 

on pBR322 which includes a truncated ampicillin 
resistance gene and the origin of DNA replication. 

3. A DNA fragment, 930 bp long, spanning £v&H-N£l£l sites 
25 on E. coli DNA which includes the deo P X P 2 promoters 

and RBS (13) . 

4. A DNA fragment, 188 bp long, spanning Nde l-PpuMI sites 
of human CuZnSOD cDNA. The cysteines at positions 6 

30 and 57 of mature SOD were substituted with serine 

residues and the GC content of this fragment was 
reduced to 38% by oligonucleotide site-directed 
mutagenesis (12) . 

35 5. A synthetic DNA fragment, 16 9 bp long, with PouMI and 

BamH I ends. This region encodes Arg- insulin B chain- 
Arg- insulin A chain. 



15 



30 
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6. A synthetic 36 bp multiple cloning site polylinker with 
BamH I and Hindi I I ends. 

7. A synthetic 44 bp oligonucleotide containing the TrpA 
5 transcription terminator with HincJIII and &a£II ends 

(10) • 

Plasmid pDBAST - LAT , which confers tetracycline resistance 
and which encodes the SOD-Insulin B chain-Arg-Insulin A 
10 chain hybrid polypeptide, was introduced into E. coli strain 

S0733 (cytRstrA) and deposited in the ATCC under ATCC 
Accession Number 69361 on July 26, 1993. 



Plasmi^ pAPAST-IAT 



Another series of plasmids was constructed culminating in 
plasmid pA. BAST -LAT, which upon transformation of genetically 
engineered E. coli host cells (harboring the cI857 
repressor) was capable of directing efficient expression of 
20 a proinsulin hybrid polypeptide useful for human insulin 

production. 

The structure of plasmid pABAST-LAT, encoding SOD-Insulin B 
chain-Arg-Insulin A chain hybrid polypeptide is shown in 
25 Figure 5. The DNA sequence and corresponding amino acid 

sequence of the hybrid polypeptide are shown in Figure 7. 

Plasmid pABAST-LAT is about 3777 bp long and comprises the 
following elements (in a counterclockwise direction) : 



1. A DNA fragment, 1521 bp long, spanning Aat ll-MscI sites 
on pBR322 which includes the tetracycline resistance 



35 



2 . 



A DNA fragment, 1497 bp long, spanning Seal- Hafe 1 1 sites 
on pBR322 which includes a truncated ampicillin 
resistance gene and the origin of DNA replication. 
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3. A DNA fragment, 33 0 bp long, spanning BamHI -EcoRI sites 
on plasmid pSOD<*13 (14) which includes the A.P L promoter 
and an A vr II ' Nde I 30 oase pair long deo ribosomal 
binding site, 

5 

4. A DNA fragment, 188 bp long, spanning Ftiel -ppyifll sites 
of human CuZnSOD cDNA. The cysteines at positions 6 
and 57 of mature SOD were substituted with serine 
residues and the GC content of this fragment was 

10 reduced to 38% by oligonucleotide site-directed 

mutagenesis (12) . 

5. A synthetic DNA fragment, 169 bp long, with PpuMI and 
BamH I ends. This region encodes Arg-insulin B chain- 

15 Arg-insulin A chain. 

6. A synthetic 36 bp multiple cloning site polylinker with 
BamH I and Hind lll ends. 

20 7. A synthetic 44 bp oligonucleotide containing the TrpA 

transcription terminator with Hindi I I and &a£II ends 
(10) . 

Plasmid pABAST-LAT, which confers tetracycline resistance 
25 and which encodes the SOD- Insulin B chain- Arg- Insulin A 

chain hybrid polypeptide under the control of the AP L 
promoter, was introduced into E. coli strain 4300 (F-, bio, 
cl 857 ) and deposited in the ATCC under ATCC Accession No. 
69363 on July 26, 1993. 



30 



Bacterial cells were propagated at 30°C. Production of the 
hybrid polypeptide was induced upon temperature shift to 
42°C. 
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Fermentation. grow th conditions and purification of SOP 
proinflulin hybrid polypeptides 



I. Stock Cultures 



Stock culture of E. coli strain S0733 harboring plasmid 
pDBAST-LAT {or pBAST-R) was grown on casein medium (20gr/L 
10 casein hydrolysate, 10gr/l yeast extract and 5gr/L NaCl) 

supplemented with tetracycline (lOmg/L) . The cultures were 
then diluted two- fold with freezing medium and stored at - 
B0°C. 

15 Freezing medium: 

K 2 HP0 4 6.3 gr 

KH 2 PC 4 1.8 gr 

Na Citrate 0.4 5 gr 

MgS0 4 .7H 2 0 0.09 gr 

20 <NH 4 ) 2 S0 4 0.9 gr 

Glycerol 4 4 gr 

Per 500 ml 



II. Inoculum 

25 

The inoculum was propagated in production medium (see 
below) . Sterile medium in a shake flask was inoculated from 
stock culture and incubated 15 hours on a shaker at 37°C and 
approximately 200 r.p.m. If needed, subsequent stages in 
30 inoculum propagation were carried out in stirred aerated 

fermenters. Sterile medium was inoculated with 2-10% flask 
culture, and incubated 15 hours at 37°C, pH 7+0.5 with 
agitation and aeration to maintain the dissolved oxygen 
level above 20% air saturation. 



35 
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III. Production 



Production medium: 

K 2 HP0 4 B gr/L 

5 KH 2 P0 4 2 gr/L 

Na citrate 2 gr/L 

NH 4 C1 3 gr/L 

K 2 S0 4 0.6 gr/L 

FeS0 4 .7H 2 O 0.04 gr/L 

10 MgS0 4 .7H 2 0 0.4 gr/L 

CaCl 2 .2H 2 0 0.02 gr/L 
Trace elements 

solution 3 ml/L 

Tetracycline 0.01 gr/L 

15 Glucose 2 gr/L 

Glycerol 1 ml/L 

Trace elements solution: 

MnS0 4 .H 2 0 1 gr/L 

20 ZnS0 4 .7H 2 0 2.78 gr/L 

CoCl 2 .7H 2 0 2 gr/L 

Na 2 Mo0 4 .2H 2 0 2 gr/L 

CaCl 2 .2H 2 0 3 gr/L 

CuS0 4 .5H 2 0 1.85 gr/L 

25 H3BO3 ^ 0.5 gr/L 

HC1 (32%) *" 100 ml/L 

The production medium was inoculated with 0.5-10% inoculum 
culture and incubated at 37°C. Agitation-aeration rates 
30 were set to maintain the dissolved oxygen level above 20% 

air saturation. The pH was maintained at 7+0.2 with NH 3 . 



Sterile solutions of 50% glucose and 30% glycerol were 
infused to supply energy and carbon sources. Once cell 
3 5 concentration reached an OD €60 of 25, sterile solutions of 

10% glucose and 3 0% glycerol were infused and growth 
continued for about 5 hours until cell concentration reached 
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an approximate OD 660 of 60. The culture was then chilled 
and cells were recovered by centrifugation. Fermentation of 
E. coli in the presence of any one of glucose, glycerol, 
galactose or a combination thereof as carbon source 
facilitated the expression of the SOD-proinsulin hybrid 
polypeptides . 

IV. Purification 

The SOD-proinsulin hybrid polypeptides expressed by plasmids 
pBAST-R and pDBAST-LAT accumulated in intracellular 
precipitate which was isolated by the following procedure: 
1 gr (wet weight) of bacterial cake was suspended in 10 ml 
buffer containing 50mM Tris-HCl, pH 8, lOmM EDTA and was 
treated with lysozyme (Merck, 2500 u/ml) at 37°C for 2 
hours. The mixture was then sonicated and Nonidet-P-4 0 
(Sigma) or Triton X 100 was added to a final concentration 
of 2% and stirred for 2 hours at room temperature. The 
precipitate was pelleted by centrifugation and washed with 
water. 

The hybrid polypeptide was purified to near homogeneity by 
anion exchange chromatography as follows. The precipitate 
was dissolved in 8M urea, 20mM Tris-HCl, 200tnM £- 
mercaptoethanol, pH 8.2. The solution was clarified by 
centrifugation and chromatographed on DEAE-Sepharose Fast- 
Flow column (Pharmacia LKB) , pre -equilibrated in 8M Urea, 
20mM Tris-HCl, 20mM fc-mercaptoethanol , pH 8.2. Flow- through 
material was collected and the hybrid protein was either 
precipitated with (NH 4 ) 2 S0 4 at 40% saturation or 
concentrated by ultrafiltration on 10K membrane followed by 
diaf iltration against lOOmM Glycine-HCl, pH 3.1. 

Alternatively, the SOD-proinsulin hybrid polypeptide 
expressed by plasmid pBAST-R was purified to near 
homogeneity by dissolution in 8M urea, 20mM Dithiothreitol , 
50mM NaAcetate, pH 5, and by ultrafiltration through a 
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series of lOOkD and 50kD membranes (Filtron) . The hybrid 
polypeptide was concentrated on a lOkD membrane and 
precipitated with (NH 4 ) 2 S0 4 at 40% saturation. 
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Foldina an d enzymatic cleavage of the SOD-proinsulin hybrid 
polypeptides 

5 

Proinsulin hybrid polypeptides, obtained by (NH 4 ) 2 S0 4 
precipitation or by ultrafiltration (Example 2) , were 
dissolved in 8M urea, 5mM HCl and diluted into lOOmM glycine 
buffer, pH 8.5-12.0 at a final concentration of about 1 
1 0 mg/ml . 

A. Folding of the SOD-proinsulin hybrid polypeptide 
expressed by plasmid pBAST-R took place at about 4-37°C 
for a period of about 1-24 hours in order to permit 
15 correct disulfide bond formation. 



The pH of the solution containing the folded, disulfide 
bonded hybrid polypeptide was adjusted to about 8.8-9.0 
with HCl and the protein was treated with trypsin and 
20 carboxypeptidase B at 16-37°C for 30-120 minutes. 

After considerable experimentation, it was found that 
the optimal conditions were as follows: The hybrid 
polypeptide expressed by plasmid pBAST-R was dissolved 

25 in 8M urea, 5mM HCl and diluted into lOOmM glycine 

buffer, pH 11.0 (Figure 8) at a final concentration of 
about 1 mg/ml, after which folding of the hybrid 
polypeptide took place for 6-16 hours at 25 °C, 
whereafter the folded, disulfide bonded hybrid 

30 polypeptide was cleaved with trypsin (1:500 w/w) and 

carboxypeptidase B (1:200 w/w) at 37°C for 3 0-60 
minutes . 



Insulin generation by enzymatic cleavage of the folded 
35 disulfide bonded proinsulin hybrid polypeptide 

expressed by plasmid pBAST-R is diagrammatically shown 
in Figure 1. 
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B. Folding of the SOD-proinsulin hybrid polypeptide 
expressed by plasmid pDBAST-LAT took place at about 7- 
31°C for a period of about 5-30 hours in order to 
permit correct disulfide bond formation. 

5 

The pH of the solution containing the folded, disulfide 
bonded hybrid polypeptide was adjusted to about 8.8-9.0 
with HC1 and the protein was treated with trypsin and 
carboxypeptidase B at 22-37°C for 30 minutes to 16 
10 hours. 

After considerable experimentation, it was found that 
the optimal conditions were as follows: The hybrid 
polypeptide expressed by plasmid pBAST-R was dissolved 

15 in 8M urea, 5mM HCl and diluted into lOOmM glycine 

buffer, pH 11-0-11.25 (Figure 8) at a final 
concentration of about 1 mg/ml, after which folding of 
the hybrid polypeptide took place for 5 hours at 2 5°C, 
whereafter the folded, disulfide bonded hybrid 

20 polypeptide was cleaved with trypsin (1:15.000 w/w) and 

carboxypeptidase B (1:10.000 w/w) at 25°C for 16 hours. 

Insulin generation by enzymatic cleavage of the folded 
disulfide bonded proinsulin hybrid polypeptide 
25 expressed by plasmid pDBAST-LAT is diagrammatically 

shown in Figure 2 . 



30 



Examples of specific conditions for both A and B above are 
detailed in the legends to Figures 8-14. 
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Protein analysis and purification of human insul in from the 

SQP-prQjngVlin hybrid polypeptide expressed bv plasmid 

5 PBAST-R 

Human insulin generation from the SOD-proinsulin hybrid 
polypeptide expressed by plasmid pBAST-R was determined by 
radioimmunoassay and RP-HPLC, utilizing commercial human 

10 insulin as standard (Calbiochem) . The theoretical yield of 

recombinant human insulin as calculated according to the 
amino acid sequence of the proinsulin hybrid polypeptide is 
45.6%. It is evident from Figure 8 that optimal folding 
occurs at pH 11. At this pH value, insulin production 

15 amounts to about 80% of the theoretical yield (which 

corresponds to about 40% of the input hybrid polypeptide) .. 
Human insulin produced from the proinsulin hybrid 
polypeptide expressed by plasmid pBAST-R, was detected by 
RP-HPLC. A Vydac 218TP54, 250 x 4.6 mm I.D. (Separation 

20 Group) , 5 /zm, 3 00 A pore size column was used at room 

temperature with a flow rate of 1 ml/min. 0.1% 
Trif luoroacetic acid (TFA) in H 2 0 was used as eluant A and 
0.08% TFA in acetonitrile as eluant B. The column was 
washed for 5 minutes in equilibration buffer (25% eluant B) 

25 followed by a linear gradient of 25-50% eluant B during 37.5 

minutes. Absorbance was monitored at 220 nm or at 260nm. 
Analysis of the human insulin following the enzymatic 
digestion of the folded, disulfide bonded hybrid polypeptide 
using Reverse Phase-High Pressure Liquid Chromatography 

30 revealed a major peak with the same retention time as 

standard human insulin. 

Two small scale batches were prepared yielding 26mg and 13mg 
of human insulin respectively. Human insulin was purified 
35 from the enzyme-treated solution (pH 9) by ultrafiltration 

on either 3K or 5K membranes (Filtron) followed by CM- 
Sepharose chromatography (citrate buffer, pH 3) . Peak 
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fractions. were desalted, lyophilized and subjected to N- 
terminal sequencing and amino acid analysis . The amino acid 
composition of both batches of recombinant human insulin was 
essentially identical to naturally- occurring human insulin 
5 (see Table 1, preparation 1) . The sequence of 5 amino acids 

at the amino terminus of the insulin preparations was 
determined by Edman degradation. It was found to be 
identical to the NH 2 -terminus of both the A and B chain of 
human insulin, which confirms the authenticity of the iii 
10 vitro product. 

However, the sequencing results indicated the presence of an 
extra Arg residue at the first position in about 25% of the 
molecules. This result corresponds to trypsin cleavage 
15 between Lys and Arg, inside the linker sequence Lys-Arg, 

thus leaving an additional Arg residue on the amino terminus 
of the A-chain. 

It was found that specific hydrolysis at the C- terminal of 
20 Arg by trypsin can be achieved by performing the reaction at 

pH 11. At this elevated pH, most of the e -amino groups of 
Lys are not charged (pK«10.3) thus enabling selective 
cleavage. Two batches yielding 1 mg and 6.5 mg of purified 
insulin were obtained by carrying out the trypsin step at pH 
25 11 (see Table 1, preparation 2) followed by carboxypeptidase 

B digestion at pH 8.5. N-terminal sequencing revealed that 
the amount of insulin comprising an extra Arg was reduced to 
about 5% . 
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TABLK 1 

Amino Acid Composition of Recombinant Human Insulin 



10 



15 



20 



25 



Amino 

Iri H 


Number of residues 


X liwUl CLlvuX 


O L. axiuai u 

Insulin 


PrAns T*"a t" ion 

* ^ CMQX A b» wit 
1 


Prpnarat ion 
2 


Asx 


3 


3.20 


3 .38 


3 .26 




Thr 


3 


2.98 


2 . 83 


2.68 


Ser 


3 


2.84 


2.53 


2 .77 


Glx 


7 


7.15 


7.73 


7.23 


Pro 


1 


1.28 


1.13 


1.09 


"77— 

Gly 


4 


4 .24 


4 .39 


4.25 


Ala 


1 


1. 00 


1.28 


1.04 




6 


5.88 


5.11 


5.79 


Val 


4 


3.82 


4.58 


3.88 


lie 


2 


2.04 


1.96 


1. 96 


Leu 


6 


5.87 


6.10 


5.99 


Tyr 


4 


3.80 


3.80 


3 .87 


1 Phe 


3 


3.15 


3.56 


3.03 


His 


2 


2.04 


2.05 


2.08 


Lys 


1 


1.01 


1.05 


1.02 


Arg 


1 


0.96 


1.30 


1.18 



Preparation 1 and 2 show the amino acid composition of 
recombinant human insulin produced from the proinsulin 
hybrid polypeptide expressed by plasmid pBAST-R. Trypsin 
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cleavage was carried out either at pH 9 (preparation 1) or 
at pH 11 (preparation 2) . 



Amino acid analysis was performed after performic acid 
oxidation and gas phase hydrolysis of purified insulin 
preparations . 
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Example 5, 

Peptide analysis of purified human insulin produced from the 
SOD-proinsulin hybrid polypeptide expressed bv plasmid 

PPAST-E 

Purified human insulin produced as described in the above 
Examples, was subjected to peptide analysis utilizing 
endoproteinase Glu-C (Sigma) , which hydrolyzes peptide bonds 
at the carboxyl side of glutamyl residues. 

In more detail, insulin samples (100 jig) , produced by 
cleavage of the folded, disulfide bonded proinsulin hybrid 
polypeptide expressed by plasmid pBAST-R, were digested with 
5 fig Glu-C for 6 hrs at 37°C in 100 /xl of 0.1 M Tris-HCl, pH 
7.8. HPLC analysis was performed: samples of commercially 
available (control) insulin and insulin produced by cleavage 
of the folded, disulfide bonded proinsulin hybrid 
polypeptide expressed by plasmid pBAST-R were acidified to 
a pH of about 3 and were separated by RP-HPLC. A Vydac 
218TP54, 250 x 4.6 mm I.D., 5/xm, 300 A pore size column was 
used. The column was equilibrated with 50 mM 

tetraethylammonium phosphate, 162mMNaC10 4 , pH3, containing 
31.5% (v/v) acetonitrile and was developed with a linear 
gradient of 35-45% acetonitrile during 75 minutes at a flow 
rate of 1 ml /minute . Absorbance was monitored at 220nm. 

All expected peptides were generated in agreement with the 
control reaction even though a minor shoulder following the 
peak corresponding to one of the fragments is probably 
related to des-Thr (B 30 ) insulin-like molecule (15) . 

Examples 4 and 5 indicate that the recombinant polypeptide 
expressed by plasmid pBAST-R comprises the sequence of 
naturally-occurring human insulin. A minor portion of the 
recombinant protein produced comprised forms such as 
Arg(Ao), desamido- or des-Thr (B 30 ) insulin-like molecules. 
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These unwanted by-products can be eliminated by 
chromatographic procedures such as RP-HPLC as described 
above . 
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Protein analysis and purification of human insulin produced 

from the prpAngujin hybrid PplypgPtidg expressed by plasma 
PDBAST-IAT 

In order to avoid generation of Arg(Ao) insulin by-product 
(Examples 4 and 5) , expression plasmid pBAST-R was modified 
to comprise DNA coding only for an Arg residue between the 
A and B chains of the proinsulin hybrid polypeptide as 
opposed to DNA coding for Lys-Arg between the A and B chains 
of the proinsulin hybrid polypeptide expressed by plasmid 
pBAST-R. This resulted in expression plasmids pDBAST-LAT 
(Example IB) and pA.BAST-LAT (Example 1C) . 

Efficient production of insulin occurred following folding 
and enzymatic treatment with trypsin and CPB of the folded, 
disulfide bonded proinsulin hybrid polypeptide expressed by 
new expression plasmid pDBAST-LAT. The presence of insulin- 
like contaminants was low (Figure 9) . Folding was optimal 
at pH 11.25 (Figure 10) and was significantly enhanced in 
the presence of about 2 moles ascorbic acid per mole SH 
group in the reaction mixture (Figure 11) . 

The effect of protein concentration on the yield of insulin 
produced from proinsulin hybrid polypeptide was determined 
in a series of reactions under otherwise optimal folding 
conditions. Optimal yields were obtained when protein 
concentration did not exceed 1.5 mg/ml (Figure 13). 

The insulin was purified by DEAE-Sepharose chromatography 
followed by RP-HPLC (as described in Figure 9) . As is 
evident from Figure 12, the recombinant human insulin 
produced had the same retention time as standard 
(commercially available) human insulin. The amino acid 
composition of the purified recombinant human insulin 
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preparation is identical to standard insulin (see Table 2, 
recombinant insulin) . 



Note that Table 2 indicates that the insulin produced from 
5 the proinsulin hybrid polypeptide expressed by plasmid 

pDBAST-LAT did not have the extra Arg residue attached to 
the insulin A chain (Arg (Ao) insulin) as described in Example 
4. Thus the preferred plasmid for production of insulin is 
plasmid pDBAST-LAT and the preferred sequence for the 
10 proinsulin hybrid polypeptide is that shown in Figure 7. 
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TABLE 2 



Amino Acid Composition of Recombinant Human Insulin 



10 



15 



20 



25 



Amino 

1 ACJLU 


Number of residues 


Theoretical 


Standard 
Insulin 


Recombinant 
insulin 


Asx 


3 


3,20 


3.32 


Thr 


3 


2.96 


2.73 




3 


2.84 


2.71 


Glx 


7 


7.15 


7.41 


Pro 


1 


1.28 


1.02 


Gly 


4 


4 . 24 


4 .46 


Ala 


1 


1. 00 


1.09 


Cys 


6 


5. 86 


5.28 


Val 


4 


3.82 


4 .00 | 


lie 


2 


2. 04 


1.91 


Leu 


6 


5.87 


6 .34 


Tyr 


4 


3.80 


3 .64 


Phe 


3 


3.15 


3 .06 


His 


2 


2.04 


2 .18 


Lys 


1 


1.01 


1 .02 


Arg 


1 


0. 96 


1 .07 



The amino acid composition of standard human insulin and 
recombinant human insulin produced from the proinsulin 
hybrid polypeptide expressed by plasmid pDBAST-LAT are 
shown . 
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Amino acid analysis was performed after performic acid 
oxidation and gas phase hydrolysis of purified insulin 
preparations. 



5 
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Example 2* 



Human insulin production from £h£ proinsulin hybrid 

polypeptide expyeggefl fry plasm^ pDBAST-frAT from cirude 

5 intracellular precipitate 

An improved method for folding and enzymatic conversion of 
the proinsulin hybrid polypeptide to insulin was carried out 
by using crude intracellular precipitate, omitting the need 

10 for the initial purification step as described in Example 2, 

part IV. Efficient production of insulin occurred following 
enzymatic cleavage of the folded, disulfide bonded 
proinsulin hybrid polypeptide with trypsin and 
carboxypeptidase B (Figure 14 and Table 3) . Insulin yields 

15 were calculated as the percent of initial protein 

concentration (A 2B0 ) as determined at the precipitate 
dissolution step at pH 12 (Figure 14) . Folding of SOD- 
proinsulin hybrid polypeptide from crude intracellular 
precipitate was shown to be optimal at about 4.5 hours from 

20 the start of the experiment (Figure 14) . 

Table 3 summarizes the partial purification of insulin from 
the proinsulin hybrid polypeptide expressed by plasmid 
pBAST-LAT from crude intracellular precipitate prepared from 

25 one liter fermentation culture at an O.D. 660 of 45. 

Dissolution and folding were carried out as described for 
Figure 14. At 4.5 hours from dissolution, the folded bulk 
solution including the folded, disulfide bonded proinsulin 
hybrid polypeptide was titrated to pH 8.8 with concentrated 

30 hydrochloric acid. ZnCl 2 (to SQ(M final concentration), 

carboxypeptidase B (1:4000 w/w) and trypsin (1:6000 w/w) 
were added. Digestion was performed for 3 hours at 37°C and 
was terminated by addition of phenylmethylsulf onyl fluoride 
(PMSF) to 0.5mM final concentration. Analysis by HPLC (as 

35 described in Figure 9) indicated an insulin yield of 169 mg. 

Insulin was purified by a sequence of anion-exchange and 
hydrophobic chromatographic steps . Digested folding mixture 
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was loaded on DEAE Sepharose Fast Flow (Pharmacia) column 
pre . equilibrated in 20mM Tris-HCl, 10mM NaCl pH 8 buffer at 
about 50 A 280 units per ml resin. Bound material was washed 
with 20mM Tris-HCl, lOOmM NaCl, pH 8 buffer and insulin 
5 eluted with 250mM NaCl in the same buffer. Pool fractions 

containing insulin represented 20% of loaded protein and had 
a purity of 37.1%. Ammonium sulfate was added to the DEAE 
elution pool to a concentration of 410 mM and was loaded on 
Phenyl -Sepharose Fast Flow column pre-equilibrated in 20mM 

10 Tris HC1, 540mM Ammonium sulfate at about 12 A 280 units per 

ml resin. Bound material was washed with equilibration 
buffer and insulin eluted with 20mM Tris HCl, 220mM ammonium 
sulphate, pH 8 buffer. Fractions containing insulin 
represented 42.3% of loaded protein and had a purity of 

15 74.1%. As a result of this partial purification process, 

120 mg insulin, identical to standard insulin, was produced 
which corresponds to an insulin yield of 5.16%. Further 
purification of insulin may be carried out by use of methods 
known in the art, e.g. gel filtration, RP-HPLC and 

20 crystallization (17) . 
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TABLE ? 



Purification of recombinant human insulin, produced from the 
proinsulin hybrid polypeptide expressed by plasmid pDBAST- 
LAT, following dissolution of crude intracellular 
precipitate, folding and enzymatic treatment with trypsin 
and carboxypeptidase B. 



10 



15 



20 



25 



fl 

J! Purification 
J step 


**2B0 


minimum amount 
of insulin by 
HPLC - in mg 


% fl 

puri [1 

ty J 


Precipitate 
dissolution 


232 
6 






Charcoal 
treatment 


191 
5 






Folding and 
enzymatic 
treatment 


191 
5 


169 


8 . 8 


DEAE - Sepharose 
pool 


383 


142 


37.1 


Phenyl - 
| Sepharose pool 


162 


120 


... 



A 280 represents the total absorbance at 280 nm at each 
purification step. Insulin presence was determined by 
HPLC analysis relative to standard insulin as described 
for Figure 9 and corresponds to the major insulin peak of 
standard insulin. 
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is claimed is: 

A method of producing insulin which comprises : 

(a) folding a hybrid polypeptide comprising 
proinsulin under conditions that permit correct 
disulfide bond formation; 

(b) subjecting the folded, disulfide bonded hybrid 
polypeptide to enzymatic cleavage to produce 
insulin ; 

(c) purifying the insulin. 

A method according to claim 1 wherein step (a) 
further comprises incubating the hybrid polypeptide 
at about 4-37°C for a period of about 1-30 hours at a 
pH of about 8.5-12.0, 

A method according to claim 2 wherein the incubation 
takes place in the presence of ascorbic acid. 

A method according to claim 2 wherein the pH is 11.0- 
11.25 . 

A method according to claim 3 wherein the pH is 11.0- 
11.25. 

A method according to claim 3 wherein the 
concentration of ascorbic acid is about 2 moles per 
mole SH group present in the folding mixture. 

A method according to claim 2 wherein the incubation 
period is about 5 hours. 

A method according to claim 3 wherein the incubation 
period is about 5 hours. 
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9. A method according to claim 1 wherein step (b) 
further comprises: 

(i) adjusting the pH to about 8.8-9.0; and 

(ii) cleaving the hybrid polypeptide with trypsin and 
carboxypeptidase B at 16-37°C for 3 0 minutes to 
16 hours. 

10. A method according to claim 1 wherein step (c) 
further comprises purification by means of DEAE- 
Sepharose chromatography and RP-HPLC. 

11. A method according to claim 1 wherein step (c) 
further comprises purification by means of 
ultrafiltration and CM-Sepharose chromatography. 

12. A method according to claim 1 wherein step (c) 
further comprises purification by means of DEAE- 
Sepharose chromatography and Phenyl -Sepharose 
chromatography . 

13. A method according to claim 1 wherein the proinsulin 
hybrid polypeptide is expressed by plasmid pDBAST-IAT 
deposited under ATCC Accession No. 693 61. 

14. A method according to claim 1 wherein the proinsulin 
hybrid polypeptide is expressed by plasmid plBAST-IAT 
deposited under ATCC Accession No. 69363. 

15. A method according to claim 1 wherein the proinsulin 
hybrid polypeptide is expressed by plasmid pBAST-R 
deposited under ATCC Accession No. 69362. 
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-so- 
le. A method according to claim 1 wherein the hybrid 
polypeptide of step (a) is obtained by treating a 
bacterial cell containing DNA encoding the hybrid 
polypeptide, so that the DNA directs expression 
5 thereof . 

17. A method according to claim 16 wherein the hybrid 
polypeptide is recovered from the cell. 

10 18. A method according to claim 16 wherein the treating 

comprises fermentation in the presence of glucose, 
glycerol or galactose. 

19. A method according to claim 17 wherein the recovery 
15 comprises: 

(i) disrupting the cell wall of the bacterial cell 
or fragments thereof to produce a lysate; 

20 (ii) isolating intracellular precipitate from the 

lysate by centrifugation; and 

(iii) solubilizing the precipitate. 

25 20. A polypeptide comprising proinsulin and a leader 

peptide attached to the N- terminus of the proinsulin, 
wherein the polypeptide is folded and contains 
correct disulfide bonds. 

3 0 21. A polypeptide according to claim 20 wherein the 

leader peptide is derived from the N- terminus of 
CuZnSOD . 

22. A polypeptide according to claim 21 wherein the 
35 leader peptide comprises 62 amino acids, being 

preceded by the amino acid Met and followed by an Arg 
residue. 
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23. A polypeptide according to claim 20 wherein the 

proinsulin comprises the insulin B-chain covalently 
attached to the insulin A-chain by a single Arg 



10 



24. A polypeptide according to claim 20 wherein the 
proinsulin comprises the insulin B-chain covalently 
attached to the insulin A-chain by the dipeptide Lys 
Arg. 

25. A polypeptide according to claim 20 wherein the 
cysteine residues of the leader peptide have been 
replaced by serine residues. 
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FIGURE 4 
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FIGURE 5 
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FIGURE 8 
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FIGURE 9 
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FIGURE 11 
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FIGURE 12 
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FIGURE 13 




SUBSTITUTE SHEET (RULE 26) 



A 



WO 96/20724 PCT/US94/13268 



14/14 



FIGURE 14 
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